Network Data Mining: Depicting the Interaction between Attributes
نویسندگان
چکیده
Network Data Mining identifies emergent networks between myriads of individual data items and utilises special algorithms that aid visualisation of ‘emergent’ patterns and trends in the linkage. It complements conventional data mining methods, which assume the independence between the attributes and the independence between the values of these attributes. These techniques typically flag, alert or alarm instances or events that could represent anomalous behaviour or irregularities because of a match with pre-defined patterns or rules. They serve as ‘exception detection’ methods where the rules or definitions of what might constitute an exception are able to be known and specified ahead of time. Many problems are suited to this approach. Many problems however, especially those of a more complex nature, are not well suited. The rules or definitions simply cannot be specified. For example, in the analysis of transaction data there are no known suspicious transactions. This paper presents a human-centred network data mining methodology that addresses the issues of depicting implicit relationships between data attributes and/or specific values of these attributes. A case study from the area of security illustrates the application of the methodology and corresponding data mining techniques. The paper argues that for many problems, a ‘discovery’ phase in the investigative process based on visualisation and human cognition is a logical precedent to, and complement of, more automated ‘exception detection’ phases.
منابع مشابه
Artificial Intelligence for prediction of porosity from Seismic Attributes: Case study in the Persian Gulf
Porosity is one of the key parameters associated with oil reservoirs. Determination of this petrophysical parameter is an essential step in reservoir characterization. Among different linear and nonlinear prediction tools such as multi-regression and polynomial curve fitting, artificial neural network has gained the attention of researchers over the past years. In the present study, two-dimensi...
متن کاملNetwork data mining: methods and techniques for discovering deep linkage between attributes
Network Data Mining identifies emergent networks between myriads of individual data items and utilises special algorithms that aid visualisation of ‘emergent’ patterns and trends in the linkage. It complements conventional data mining methods, which assume the independence between the attributes and the independence between the values of these attributes. These techniques typically flag, alert ...
متن کاملApplication of Artificial Neural Networks and Support Vector Machines for carbonate pores size estimation from 3D seismic data
This paper proposes a method for the prediction of pore size values in hydrocarbon reservoirs using 3D seismic data. To this end, an actual carbonate oil field in the south-western part ofIranwas selected. Taking real geological conditions into account, different models of reservoir were constructed for a range of viable pore size values. Seismic surveying was performed next on these models. F...
متن کاملLimestone chemical components estimation using image processing and pattern recognition techniques
In this study based on image analysis, an ore grade estimation model was developed. The study was performed at a limestone mine in central Iran. The samples were collected from different parts of the mine and crushed in size from 2.58 cm down to 15 cm. The images of the samples were taken in appropriate environment and processed. A total of 76 features were extracted from the identified rock sa...
متن کاملPrediction of chronic kidney disease in Isfahan with extracting association rules using data mining techniques
Background: Millions of deaths occur around the world each year due to lack of access to appropriate treatment for chronic kidney disease patients. Given the importance and mortality rate of this disease, early and low-cost prediction is very important. The researchers intend to identify chronic kidney disease through the optimal combination of techniques used in different stages of data mining...
متن کامل